智能论文笔记

Evolutionary Multiparty Distance Minimization

Zeneng She , Wenjian Luo , Xin Lin , Yatong Chang , Yuhui Shi

分类：神经与进化计算 | 人工智能

2022-07-27

在进化多目标优化领域，决策者（DM）涉及相互冲突的目标。在现实世界中，通常存在多个DM，每个DM都涉及这些目标的一部分。提出了多方多目标优化问题（MPMOPS）来描绘拖把，其中涉及多个决策者，每个方都关注所有目标的某些目标。但是，在进化计算字段中，对mpmops的关注不多。本文基于距离最小化问题（DMP）构建了一系列MPMOP，它们的Pareto最佳解决方案可以生动地可视化。为了解决MPMOPS，新提出的算法OPTMPNDS3使用多方初始化方法来初始化总体，并带Jade2操作员生成后代。在问题套件上，将OPTMPNDS3与Optall，OptMPND和OptMPNDS2进行了比较。结果表明OPTMPNDS3与其他算法具有很强的可比性

translated by 谷歌翻译

Benchmark Functions for CEC 2022 Competition on Seeking Multiple Optima in Dynamic Environments

Wenjian Luo , Xin Lin , Changhe Li , Shengxiang Yang , Yuhui Shi

分类：神经与进化计算

2022-01-03

动态和多模式特征是两个重要的属性，并且在许多真实世界优化问题中广泛存在。前者说明了这些问题的目标和/或限制随着时间的推移而变化，而后者意味着在每个环境中存在多于一个最佳解决方案（有时包括接受的本地解决方案）。动态多峰优化问题（DMMOPS）具有这些特征，这些特征都在进化计算和群体智能领域中进行了多年，并吸引了越来越多的关注。解决这些问题需要优化算法在更改环境中同时跟踪多个Optima。因此，决策者可以根据他们的经验和偏好挑选每个环境中的一个最佳解决方案，或者当当前一个无法正常工作时，或者快速转向其他解决方案。这对决策者来说非常有帮助，特别是在面临改变环境时。在本次竞争中，给出了关于DMMOPS的测试套装，其中模拟了现实世界的应用程序。具体而言，该测试服采用8个多模函数和8种变化模式来构建24个典型的动态多模态优化问题。同时，还可以给出度量来测量算法性能，这考虑了所有环境中发现的最佳解决方案的平均数。促进动态多式化优化算法的发展将非常有帮助。

translated by 谷歌翻译

Further Improving Weakly-supervised Object Localization via Causal Knowledge Distillation

Feifei Shao , Yawei Luo , Shengjian Wu , Qiyi Li , Fei Gao , Yi Yang , Jun Xiao

分类：计算机视觉

2023-01-03

Weakly-supervised object localization aims to indicate the category as well as the scope of an object in an image given only the image-level labels. Most of the existing works are based on Class Activation Mapping (CAM) and endeavor to enlarge the discriminative area inside the activation map to perceive the whole object, yet ignore the co-occurrence confounder of the object and context (e.g., fish and water), which makes the model inspection hard to distinguish object boundaries. Besides, the use of CAM also brings a dilemma problem that the classification and localization always suffer from a performance gap and can not reach their highest accuracy simultaneously. In this paper, we propose a casual knowledge distillation method, dubbed KD-CI-CAM, to address these two under-explored issues in one go. More specifically, we tackle the co-occurrence context confounder problem via causal intervention (CI), which explores the causalities among image features, contexts, and categories to eliminate the biased object-context entanglement in the class activation maps. Based on the de-biased object feature, we additionally propose a multi-teacher causal distillation framework to balance the absorption of classification knowledge and localization knowledge during model training. Extensive experiments on several benchmarks demonstrate the effectiveness of KD-CI-CAM in learning clear object boundaries from confounding contexts and addressing the dilemma problem between classification and localization performance.

translated by 谷歌翻译

Optimization of Image Transmission in a Cooperative Semantic Communication Networks

Wenjing Zhang , Yining Wang , Mingzhe Chen , Tao Luo , Dusit Niyato

分类：人工智能 | 计算机视觉

2023-01-01

In this paper, a semantic communication framework for image transmission is developed. In the investigated framework, a set of servers cooperatively transmit images to a set of users utilizing semantic communication techniques. To evaluate the performance of studied semantic communication system, a multimodal metric is proposed to measure the correlation between the extracted semantic information and the original image. To meet the ISS requirement of each user, each server must jointly determine the semantic information to be transmitted and the resource blocks (RBs) used for semantic information transmission. We formulate this problem as an optimization problem aiming to minimize each server's transmission latency while reaching the ISS requirement. To solve this problem, a value decomposition based entropy-maximized multi-agent reinforcement learning (RL) is proposed, which enables servers to coordinate for training and execute RB allocation in a distributed manner to approach to a globally optimal performance with less training iterations. Compared to traditional multi-agent RL, the proposed RL improves the valuable action exploration of servers and the probability of finding a globally optimal RB allocation policy based on local observation. Simulation results show that the proposed algorithm can reduce the transmission delay by up to 16.1% compared to traditional multi-agent RL.

translated by 谷歌翻译

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs

Huaizheng Zhang , Yuanming Li , Wencong Xiao , Yizheng Huang , Xing Di , Jianxiong Yin , Simon See , Yong Luo , Chiew Tong Lau , Yang You

分类：机器学习

2023-01-01

New architecture GPUs like A100 are now equipped with multi-instance GPU (MIG) technology, which allows the GPU to be partitioned into multiple small, isolated instances. This technology provides more flexibility for users to support both deep learning training and inference workloads, but efficiently utilizing it can still be challenging. The vision of this paper is to provide a more comprehensive and practical benchmark study for MIG in order to eliminate the need for tedious manual benchmarking and tuning efforts. To achieve this vision, the paper presents MIGPerf, an open-source tool that streamlines the benchmark study for MIG. Using MIGPerf, the authors conduct a series of experiments, including deep learning training and inference characterization on MIG, GPU sharing characterization, and framework compatibility with MIG. The results of these experiments provide new insights and guidance for users to effectively employ MIG, and lay the foundation for further research on the orchestration of hybrid training and inference workloads on MIGs. The code and results are released on https://github.com/MLSysOps/MIGProfiler. This work is still in progress and more results will be published soon.

translated by 谷歌翻译

A Multi-Source Information Learning Framework for Airbnb Price Prediction

Lu Jiang , Yuanhan Li , Na Luo , Jianan Wang , Qiao Ning

分类：机器学习

2023-01-01

With the development of technology and sharing economy, Airbnb as a famous short-term rental platform, has become the first choice for many young people to select. The issue of Airbnb's pricing has always been a problem worth studying. While the previous studies achieve promising results, there are exists deficiencies to solve. Such as, (1) the feature attributes of rental are not rich enough; (2) the research on rental text information is not deep enough; (3) there are few studies on predicting the rental price combined with the point of interest(POI) around the house. To address the above challenges, we proposes a multi-source information embedding(MSIE) model to predict the rental price of Airbnb. Specifically, we first selects the statistical feature to embed the original rental data. Secondly, we generates the word feature vector and emotional score combination of three different text information to form the text feature embedding. Thirdly, we uses the points of interest(POI) around the rental house information generates a variety of spatial network graphs, and learns the embedding of the network to obtain the spatial feature embedding. Finally, this paper combines the three modules into multi source rental representations, and uses the constructed fully connected neural network to predict the price. The analysis of the experimental results shows the effectiveness of our proposed model.

translated by 谷歌翻译

Robust Domain Adaptive Object Detection with Unified Multi-Granularity Alignment

Libo Zhang , Wenzhang Zhou , Heng Fan , Tiejian Luo , Haibin Ling

分类：计算机视觉

2023-01-01

Domain adaptive detection aims to improve the generalization of detectors on target domain. To reduce discrepancy in feature distributions between two domains, recent approaches achieve domain adaption through feature alignment in different granularities via adversarial learning. However, they neglect the relationship between multiple granularities and different features in alignment, degrading detection. Addressing this, we introduce a unified multi-granularity alignment (MGA)-based detection framework for domain-invariant feature learning. The key is to encode the dependencies across different granularities including pixel-, instance-, and category-levels simultaneously to align two domains. Specifically, based on pixel-level features, we first develop an omni-scale gated fusion (OSGF) module to aggregate discriminative representations of instances with scale-aware convolutions, leading to robust multi-scale detection. Besides, we introduce multi-granularity discriminators to identify where, either source or target domains, different granularities of samples come from. Note that, MGA not only leverages instance discriminability in different categories but also exploits category consistency between two domains for detection. Furthermore, we present an adaptive exponential moving average (AEMA) strategy that explores model assessments for model update to improve pseudo labels and alleviate local misalignment problem, boosting detection robustness. Extensive experiments on multiple domain adaption scenarios validate the superiority of MGA over other approaches on FCOS and Faster R-CNN detectors. Code will be released at https://github.com/tiankongzhang/MGA.

translated by 谷歌翻译

ExploreADV: Towards exploratory attack for Neural Networks

Tianzuo Luo , Yuyi Zhong , Siaucheng Khoo

分类：机器学习

2023-01-01

Although deep learning has made remarkable progress in processing various types of data such as images, text and speech, they are known to be susceptible to adversarial perturbations: perturbations specifically designed and added to the input to make the target model produce erroneous output. Most of the existing studies on generating adversarial perturbations attempt to perturb the entire input indiscriminately. In this paper, we propose ExploreADV, a general and flexible adversarial attack system that is capable of modeling regional and imperceptible attacks, allowing users to explore various kinds of adversarial examples as needed. We adapt and combine two existing boundary attack methods, DeepFool and Brendel\&Bethge Attack, and propose a mask-constrained adversarial attack system, which generates minimal adversarial perturbations under the pixel-level constraints, namely ``mask-constraints''. We study different ways of generating such mask-constraints considering the variance and importance of the input features, and show that our adversarial attack system offers users good flexibility to focus on sub-regions of inputs, explore imperceptible perturbations and understand the vulnerability of pixels/regions to adversarial attacks. We demonstrate our system to be effective based on extensive experiments and user study.

translated by 谷歌翻译

Depression Diagnosis and Analysis via Multimodal Multi-order Factor Fusion

Chengbo Yuan , Qianhui Xu , Yong Luo

分类：人工智能 | 计算机视觉

2022-12-31

Depression is a leading cause of death worldwide, and the diagnosis of depression is nontrivial. Multimodal learning is a popular solution for automatic diagnosis of depression, and the existing works suffer two main drawbacks: 1) the high-order interactions between different modalities can not be well exploited; and 2) interpretability of the models are weak. To remedy these drawbacks, we propose a multimodal multi-order factor fusion (MMFF) method. Our method can well exploit the high-order interactions between different modalities by extracting and assembling modality factors under the guide of a shared latent proxy. We conduct extensive experiments on two recent and popular datasets, E-DAIC-WOZ and CMDC, and the results show that our method achieve significantly better performance compared with other existing approaches. Besides, by analyzing the process of factor assembly, our model can intuitively show the contribution of each factor. This helps us understand the fusion mechanism.

translated by 谷歌翻译

An Efficient Hierarchical Kriging Modeling Method for High-dimension Multi-fidelity Problems

Youwei He , Jinliang Luo

分类：机器学习

2022-12-31

Multi-fidelity Kriging model is a promising technique in surrogate-based design as it can balance the model accuracy and cost of sample preparation by fusing low- and high-fidelity data. However, the cost for building a multi-fidelity Kriging model increases significantly with the increase of the problem dimension. To attack this issue, an efficient Hierarchical Kriging modeling method is proposed. In building the low-fidelity model, the maximal information coefficient is utilized to calculate the relative value of the hyperparameter. With this, the maximum likelihood estimation problem for determining the hyperparameters is transformed as a one-dimension optimization problem, which can be solved in an efficient manner and thus improve the modeling efficiency significantly. A local search is involved further to exploit the search space of hyperparameters to improve the model accuracy. The high-fidelity model is built in a similar manner with the hyperparameter of the low-fidelity model served as the relative value of the hyperparameter for high-fidelity model. The performance of the proposed method is compared with the conventional tuning strategy, by testing them over ten analytic problems and an engineering problem of modeling the isentropic efficiency of a compressor rotor. The empirical results demonstrate that the modeling time of the proposed method is reduced significantly without sacrificing the model accuracy. For the modeling of the isentropic efficiency of the compressor rotor, the cost saving associated with the proposed method is about 90% compared with the conventional strategy. Meanwhile, the proposed method achieves higher accuracy.

translated by 谷歌翻译